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Abstract 

In the recent years, the multilayer networks have increasingly been realized as a more realistic 
framework to understand emergent physical phenomena in complex real world systems. We 
analyze a massive time-varying social data drawn from the largest film industry of the world 
under multilayer network framework. The framework enables us to evaluate the versatility 
of actors, which turns out to be an intrinsic property of lead actors. Versatility in dimers 
suggests that working with different types of nodes are more beneficial than with similar ones. 
However, the triangles yield a different relation between type of co-actor and the success of lead 
nodes indicating the importance of higher order motifs in understanding the properties of the 
underlying system. Furthermore, despite the degree-degree correlations of entire networks being 
neutral, multilayering picks up different values of correlation indicating positive connotations 
like trust, in the recent years. Analysis of weak ties of the industry uncovers nodes from lower 
degree regime being important in linking Bollywood clusters. The framework and the tools used 
herein may be used for unraveling the complexity of other real world systems. 


1 Introduction 

Complex network science revolves around the hypothesis that the behavior of complex systems can 
be explained in terms of the structural and functional relationships between their components by 
means of a graph representation [T], The network framework provides cue into whether the struc¬ 
tural environment confers opportunities for or constraints on individual action |2j. Social network 
formation is a complex process in which individuals create and deactivate social ties in order to si¬ 
multaneously satisfy their goals under multiple (possibly conflicting) constraints. Dynamics of social 
behaviors ranging from opinions, cultural and linguistic traits, crowd behavior, hierarchy formation, 
social spreading and human dynamics and their connections have been investigated using various 
tools of statistical physics [3]. Essential to understanding the behavior of humans within their socio- 
economical environment is the observation that they simultaneously play different roles in various 
interconnected social networks, such as friendship networks, communication networks, family, or 
business networks HIE]. This superposition of a number of networks is often termed as multilayer 
networks W- In a multilayer network, on one hand, actions of individuals help in defining the 
topological structure of networks, and on the other hand, the topology constrains and shapes the 
possible actions of individuals [MO] leading to a spurt in the activities of modeling real world complex 
systems and behavior, for example, multimodal transportation networks mi climatic systems [IT2] , 
the human brain ra and failure and robustness 0. It has recently been realized that neglecting 
the multilayer structure leads to wrong identification of the most versatile nodes, overestimating the 
importance of more marginal agents HE 

In this study, we use a huge empirical social data drawn from the largest him industry in the 
world, Bollywood. This him industry has gained worldwide coverage by selling an estimated 3.6 billion 
tickets as compared to Hollywood’s 2.6 billion as per CBFC 2006 statistics [16]. Furthermore, it is 
said to rehect or affect the decisions and preferences of the populace HE On one hand, Bollywood is 
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driven by the underlying practices prevalent in the society, while on the other hand strongly influences 
the mass, thus acting as a mirror of the society [18]. Moreover, network studies on Bollywood data 
have demonstrated how changes in interactions of the underlying networks are associated with events 
occurring in the society [T9] . Based on all the above, this film industry can be referred to as a model 
to understand and predict various aspects of society based on complex interaction patterns among 
the members. 

We construct the Bollywood networks as follows: the nodes are the actors and interactions between 
a pair of actors is defined if they have co-acted in a movie. We assort the co-actor data comprising of 
movies and their corresponding actors. Owing to the rapidly changing nature of the society [20], we 
divide the data into intervals of five years. This time frame on one hand is small enough to capture 
the minute changes shaping in the society and on the other hand is large enough so as to not miss 
out any prolific change occurring in the society. We take three such datasets in order to see how the 
properties of the networks change with time. Considering genres as layers, we construct multilayer 
networks for each time span. Our analysis using multilayer network approach, on one hand, reveals 
the importance of versatility of the nodes for their success, while on the other hand indicates the 
significance of interactions between all types of nodes in the model system. Emergence of cooperation 
with time is also revealed through our investigations. 


2 Methods 


Curation of data and network construction: We collect the Bollywood data from a reputed 
movie repository website [21]. We extract the names of all the movies, their sequential star cast list 
and genre information from 1998 to 2012 and segregate them into three datasets, each consisting 
of data for five years. We curate the movies and their details under 33 different genres. Many of 
the genres had very few movies and considering them separately would have yielded statistically 
insignificant results. Thus, we combine the genres having very few movies and categorize them as 
‘Others’. This gives us eight broad genres namely ‘Action’, ‘Comedy’, ‘Drama’, ‘Romance’, ‘Thriller’, 
‘Crime and Horror’, ‘Social’ and ‘Others’. We treat each genre as a layer of the multilayer social 
network. For a particular layer, a pair of actors i and j are connected if they have co-acted in a movie 
of the corresponding genre in the respective dataset. Thus, we obtain eight different sub-networks 
for each multilayer network corresponding the three datasets. For each layer cc, the elements of the 
adjacency matrix, A a is given as 


A a - 
Aj - 


1 if i ~ j 
0 otherwise 


( 1 ) 


All the adjacency matrices are symmetric (i.e. = A^). The degree of an actor df is the number of 

actors that actor has worked with, given as df = Afj- Note that not every actor has worked in 
movies of every genre. Thus the number of nodes may differ across the genres. The lead male actors 
can be defined as the actors occupying the first position in the movie star cast providing an easy way 
of segregating lead actors and supporting actors. However, it may be possible that supporting actors 
have acted as lead actors in few movies appearing in the first position of the star cast [21]. Further, 
it is also possible that some actors have acted in only one or two movies, but in lead roles occupying 
the first position in star cast. In order to avoid such cases, we require a threshold for finding lead 
actors, which is a common practice used for construction of networks from real data, for instance in 
construction of gene coexpression networks [22]. Note that a very low value of threshold may bring in 
lot of the supporting actors in the list of lead actors, whereas a very high value of threshold may leave 
behind many known lead actors [39]. After conducting several trials with different threshold values, 


2 


we find five movies to be the optimal criteria for all the three datasets. This broadly divides the 
actors into two types, one consisting of the lead male actors (denoted by L) and the other comprising 
of the rest of the actors, which includes both supporting actors and female actors (denoted by S). 

Filmfare data assimilation: Of the different awards categories introduced over the years of 
cinematic heritage in Bollywood, the Filmfare awards, voted by the public and a committee of experts 
is one of the oldest and the most reputed one [23]. We extract the Filmfare award nominations data 
from the website chronologically [ 23 ] and count the number of times every actor is nominated in each 
five-year span. Instead of the awards bagged we rather take into account the award nominations in 
order to avoid the interplay of some kind of bias affecting the decision. 

Dimers and triangles in one or more layers: If a dimer, a pair of interacting nodes, is 
present in only one of the eight layers, it is defined as a dimer unique to one layer. We such dimers 
are called dimers unique to one layer. If a dimer is present in only any two of the layers, we call 
it a dimer unique to two layers. Similarly, dimers unique to higher layers are defined. Further, if a 
triangle, a complete subgraph of order three, is present in only one of the eight layers, it is defined 
as a triangle unique to one layer. If a triangle is present in only any two of the layers, we call it a 
triangle unique to two layers. In a similar manner, triangles unique to higher layers are defined. Note 
that every lead actor appearing in unique LL and LS dimers or LLL, LLS, LSS and SSS triangles is 
counted only once, even if the lead actor has appeared in more than one unique LL, LS dimer or or 
LLL, LLS, LSS and SSS triangle. 

Structural measures: We quantify the degree-degree correlations of a network by considering 
the Pearson (degree-degree) correlation coefficient, given as [25) 

= [N- 1 E£ dMA - [N-t Eg1 W! + d/Y) 

INc 1 e£i + d} 2 )] - [JV-1 e£i W! + d/y\ ’ 

where d\, d l j are the degrees of nodes at both the ends of the I th connection and N c represents the 
total connections in the network. 

In order to investigate the hypothesis of weak ties on networks [26], we calculate the link be¬ 
tweenness centrality and overlap as follows. Link betweenness centrality for an undirected link e is 
defined as (3l = Yh v &v s ^2 wgv/v a vw{e)/a vw , where a vw (e ) is the number of shortest paths between v 
and w that contain e, and a vw is the total number of shortest paths between v and w. The overlap 
of the neighborhood of two connected nodes i and j is defined as O tJ = ,. _- , where is 

the number of neighbors common to both nodes i and j. Here di and dj represent the degree of the 
i th and j th nodes. 


3 Results 

Structural properties of individual layers: The degree distribution of many of the layers follow 
power law, evaluated using maximum likelihood estimation [27] (values of parameters provided in 
Table SI of [39]). Rest of them resemble power law behavior, but the maximum likelihood method 
rejects the power law hypothesis. Further, we fold that there is drastic increase in the size of the 
networks with time. In fact from 98-02 to 08-12, the size of the network becomes almost double 
(Table Q]). However, despite a drastic increase in size, the average connectivity of the nodes (actors) 
remain almost same across the three time spans (Table CD) indicating the average connectivity as an 
intrinsic property of the model system. 

Further, the average clustering coefficient ((C)), which measures the average connectivity of the 
neighbors [28], depict much higher values for all the three datasets (Table [2]) as compared those of 
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Table 1: The properties of all the three Bollywood datasets. Here N, N c , ( k ) and r represent the 
size, number of connections, average degree and overall assortativity coefficient value of the network, 
respectively. 


Dataset 

N 

N c 

(k) 

r 

08-12 

3934 

59698 

30 

0.04 

03-07 

3041 

55345 

36 

0.07 

98-02 

1899 

44277 

47 

-0.03 


the corresponding random networks (Table S2 of [39]), which is quite expected as various complex 
social and biological networks are known to exhibit high average clustering coefficient |T|. 

(Dis)likelihood in connectivity uncovers cooperation in the recent times: The (dis)assortativity 
has emerged as an important structural measure, used for understanding (dis)likelihood in connec¬ 
tivity in the underlying systems [25]. Various social networks are known to be assortative, while 
many of the biological and technological networks are found to be disassortative [2H]. We calculate 
the values of assortativity coefficient using Eq. [2] for the three networks without consideration of 
genres and find that all the networks exhibit a value of assortativity coefficient close to zero (Table 
[[]). Based on the rapidly changing nature of the society [20] , we may expect some changes reflected 
in the interaction behavior of the model system. But correlation in degrees without consideration of 
genres does not exhibit any change across time. However, in the following, we show how multilayer 
approach demonstrates changes in the interaction patterns with time. The networks corresponding 
to individual layers exhibit different values of assortativity coefficients (Table [2]). The same actors 
working in different genres may contribute differently to the assortativity coefficients of respective 
genres based on the degrees of their co-actors in each genre. Further, we generate an ensemble of 
corresponding random networks of the same size and same degree sequence [30] as of the Bollywood 
layers. The r values of different layers in the three time spans (Table [21) are significantly larger than 
their values in corresponding random networks (Table S3 of [59]). 

We find that the number of genres exhibiting assortative nature increase from the older to the 
latest dataset (Table [2]), though the entire networks without consideration of genres show neutral 
nature of degree correlations as discussed above. This is interesting in the light that networks with 
links having positive connotations like trust and endorsement have been shown to exhibit assortative 
mixing in degrees, while disassortativity has been related with negative connotations like disapproval 
and distrust [3T]. The increase in the number of genres exhibiting assortativity, as we proceed 
from 98-02 to 08-12 dataset, can be related to the enhancement of positive connotations among 
the actors with time. This along with the fact that Bollywood has become more successful in the 
recent years mm might be an indication that trust is beneficial for success. This change in the 
network structure is also revealed in the next section while we are investigating the frequency of 
occurrence of actors in different genres under multilayer framework, relating intrinsic properties with 
their individual success. 

Interplay of versatility and success: The way we construct the multilayer networks allows 
us to capture one more important property, i.e. versatility. Versatility, in the present context, can 
be referred to as the number of genres a particular actor has worked in (151 [22] . It is known to be a 
qualitative feature of actors [33] and entrepreneurs [33] and has been emphasized as a property of the 
ones excelling in multiple dimensions [36]. We find that the number of actors which includes all the 
lead and the supporting actors unique to one or more layers decrease with an increase in the number 
of layers (Fig. [[]). What stands interesting is that the number of lead actors, accounting to ~ 1.2% 
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Figure 1: Actors uniquely common in one or more layers in 08-12, 03-07 and 98-02 datasets. Nj refers 
to the number of actors, uniquely common in one or more layers. %Nl represents the percentage of 
lead actors out of the Nj actors, who are uniquely common in one or more layers. 



Figure 2: The average number of award nominations plotted as a function of number of layers (a) for 
lead actors (represented as L) unique to one or more layers in 08-12, 03-07 and 98-02 datasets; (b), 
(c) and (d) for lead actors participating in unique LL (lead-lead) and LS (lead-supporting) dimers in 
08-12, 03-07 and 98-02 datasets, respectively; (e), (f) and (g) for lead actors participating in unique 
LLL (lead-lead-lead), LLS (lead-lead-supporting) and LSS (lead-supporting-supporting) triangles in 
08-12, 03-07 and 98-02 datasets, respectively. 

of the total actors, that are unique to one or more number of layers consistently increase with an 
increase in the number of layers, which means that versatility is an intrinsic property of lead actors. 
It is noteworthy that none of the lead actors are found to be confined to only one layer (Fig. [T]) 
indicating that versatility is essential for lead actors. Such importance of versatility has also been 
reported in the biological systems, where the essential genes constituting a very small fraction of 
the total human genome encode proteins which have been known to participate in various signaling 
pathways [37], which can be considered different layers. Akin to lead actors from the successful 
Bollywood industry [32], these essential genes are very few in number [33], The importance of these 
essential genes in various signaling pathways can be related to the significance of versatility in the 
success of lead actors. 

While the versatility of lead actors does not show any definite relation with their success, captured 
by the number of award nominations (Fig. [2](a)) , interestingly investigation of versatility in motifs 
exhibit relation with the properties of individual nodes forming these motifs. Since the nodes are of 
two types, namely L and S, various network motifs can be drawn on the basis of different combinations 
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Table 2: The properties of different layers of the Bollywood multilayer networks, 08-12 represented by 
‘i’ superscript, 03-07 represented by ‘ii’ superscript and 98-02 represented by ‘hi’ superscript having 
size 3934, 3041 and 1899 nodes, respectively. Here N c , N, ( k ), ( C) and r, respectively, represent the 
number of connections, number of participating nodes, average degree, average clustering coefficient 
and assortativity coefficient for each layer. Note that the social genre has r value 1 in the 98-02 
dataset owing to only five movies in that genre. None of the actors have acted in more than one 
movie in that genre. This gives rise to five complete subgraphs rendering r value one. 


Layer 

Ab 

N 1 

C 

(ky 

(cy 

r i 

N u 

N u 

±y C 

(k) u 

(C) u 

rpii 

TV™ 

ATiii 
x y c 

(k) iU 

(C) iU 


Social 

1543 

18048 

23 

0.8 

-0.1 

887 

9891 

22 

0.9 

0.1 

65 

632 

19 

1 

l 

Drama 

1375 

14003 

20 

0.9 

0.1 

1456 

21905 

30 

0.8 

-0.1 

1094 

20713 

38 

0.8 

0 

Comedy 

1345 

17232 

26 

0.8 

0 

1013 

13865 

27 

0.8 

-0.1 

550 

8160 

30 

0.8 

0 

Romance 

1121 

9821 

18 

0.9 

-0.1 

1085 

13819 

26 

0.8 

-0.1 

808 

15050 

37 

0.8 

0 

Thriller 

946 

7296 

15 

0.9 

0.1 

766 

7658 

20 

0.8 

-0.1 

392 

4685 

24 

0.8 

0 

Action 

688 

6796 

20 

0.9 

0.1 

672 

9435 

28 

0.8 

-0.1 

666 

12018 

36 

0.8 

0 

Crime 

556 

3985 

14 

0.9 

0.2 

511 

6948 

27 

0.9 

0.2 

455 

6149 

27 

0.8 

0 

Others 

1850 

13400 

15 

0.8 

0.1 

1766 

21139 

24 

0.8 

0.2 

1185 

19940 

34 

0.7 

0 


of these nodes, such as LL and LS dimers, which are motifs of order two and LLL, LLS, LSS and SSS 
triangles, which are motifs of order three. Similar to the versatility of individual nodes, considering 
multilayer network approach, we define the versatility of motifs as the number of layers they appear. 
In the following, we show how existence of two types of nodes in the versatile motifs provides us a 
relation between the success of lead actors and the type of actors they work with. We fold that the 
average number of award nominations of lead actors remains largely independent of the type of node 
they are paired up with (Figsj2)(b), (c) and (d)). This behavior is found to be the same across the 
three datasets. However, when actor dimers are present in more than six layers, the lead actors in 
LS dimers emerge more successful as compared to those in LL dimers (Figs. EJ^b), (c) and (d)). This 
indicates that interactions between dissimilar types of nodes is more beneficial when these interacting 
partners are versatile. Only one LL dimer in 03-07 dataset defies this trend owing to the exceptional 
success of the actor pair in the him industry (discussed in supplementary material [39]). 

Further, we investigate the relation between versatility in triangles and the success of lead nodes. 
We find that the success behavior of lead actors in triangles is different from that of those in dimers. 
As plotted in Figs. [2](e) , (f) and (g), when triangles are present in less number of layers, the average 
number of award nominations of lead actors in unique LLL triangles are higher than those in unique 
LLS and LSS triangles. Although, as discussed, appearing in LL dimers does not considerably affect 
the success of lead actors, appearing in LLL triangles stands beneficial for their success. Furthermore, 
the relation between versatility of triangles and success of lead actors for LLS and LSS triangles is 
different from that followed by LLL triangles (Figs. [21(e) , (f) and (g)). This indicates that the higher 
order interaction patterns can have different behavior as compared to lower order. Note that the 
standard deviations of award nominations are quite high for many of the datasets (Table S4 of [39]), 
indicating the existence of very high as well as very low award nominations in those datasets. These 
large standard deviations indicate that on average, not all the lead actors in LS category are more 
successful than all the lead actors in LL category, but at least one or few of them together in LS 
category have more award nominations than one or few of them together in LL category. Just to 
have a unified measure for assessing the success, we consider the average award nominations. 
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Figure 3: Overlap as a function of link betweenness centrality on a logarithmic scale fitted with a 
straight line for the 08-12 dataset. 


The analysis done here so far reveals the relation of the success behavior of individuals with 
the changes in the interaction patterns between two different types of the nodes. In the following, 
analysis of the properties of links reveals a third category of nodes which help in linking different 
Bollywood clusters. 

Importance of every node: An important proposition of sociology is the ‘Weak ties hypothesis’ 
which was initially proposed by Granovetter [00], We use this concept in the context of networks, 
where the ties having low overlap in their neighborhoods (i.e. less number of common neighbors) are 
termed as the weak ties as per Onnela [26]. These weak ties have high link betweenness centrality and 
are the ones known to bridge different communities [31126]. In our study, we find that the overlap of 
ties exhibits negative correlation with their link betweenness centrality (Fig. [3]), i.e. the ties with less 
overlap have tendency to have high link betweenness centrality. We then investigate the properties of 
the end nodes of the ties having low overlap and high link betweenness centrality and find that these 
nodes are actors who are mostly neither lead actors, defined based on their position of appearance in 
the movie star cast, nor they are popular supporting actors (high degree nodes). In fact, these nodes 
appear in lower degree regime and can be considered to form a third category of the nodes. Details of 
correlation values of overlap and link betweenness centrality, and list of actors forming weak ties with 
high link betweenness centrality along with their degrees are provided in supplementary material [2S| • 
Earlier works [19] having already emphasized the importance of high and moderate degree nodes, this 
result here picks up relatively lower degree nodes which are important in linking different Bollywood 
clusters, reflecting the importance of all types of the nodes. 

4 Conclusion 

Various real world complex systems encompass multiple types of interactions [41] and hence the 
multilayer networks provide a better framework for investigating the underlying properties of such 
systems. We find an increase in the number of assortative genres in the film industry with time 
indicating increase in trust among interacting partners [31] . Since this data is drawn from a system 
which has gained progressive success over the years [32], positive connotations can be considered a 
factor underlying its success. Using the multilayer framework, we evaluate the frequency of occurrence 
of actors in different layers, termed as versatility, which emerges as an intrinsic property of lead actors. 
Furthermore, apart from the existence of various genres representing layers, the presence of two types 
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of nodes provides a richer framework for investigating the versatility of motifs. Our analysis reveals 
that the success behavior of lead actors appearing in unique triangles is different from those working 
in dimers, which suggests that higher order motifs can provide a different insight about a system. The 
results here are drawn based on dimers and triangles as analysis of higher order motifs comprising 
of lead actors would not yield significantly comparable results since the number of lead actors are 
very less. Motifs are known to be the basic building blocks of various social, technological and 
biological networks [32], hence this mode of analysis can be extended to other systems which have 
considerably high number of (dis)similar nodes, for instance predator-prey networks [43] in order 
to identify the factors governing the stability of these natural communities [44]. Furthermore, the 
analysis of the weak ties reveal the involvement of lower degree nodes in holding the system together, 
thus demonstrating the importance of all types of nodes which are defined here based on their degrees. 

While in this study, we segregate the movie co-actor data into layers based on genres, the segrega¬ 
tion of genres based on the type of emotion, either positive or negative, can further yield interesting 
results. Such investigations on the relation between types of neighboring nodes and individual prop¬ 
erties in genres with positive and negative emotions can be one of the aspects of future investigation 
and might attract research in disciplines like psychology [45] . 
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Supplementary Material 


SI Table 1: Significance of power law fitting for all the subnetworks in 08-12, 03-07 and 98-02 datasets. 


Layers 

08-12 

03-07 

98-02 


7 

%min 

p-value 

KS 

A/R 

7 

-Emin 

p-value 

KS 

A/R 

7 

Z'min 

p-value 

KS 

A/R 

Social 

2.88 

25 

0 

0.05 

R 

3.5 

37 

0.04 

0.07 

close 

1.68 

4 

0 

0.28 

R 

Drama 

3.5 

44 

0.36 

0.05 

A 

3.09 

37 

0 

0.06 

close 

2.86 

38 

0 

0.06 

R 

Comedy 

2.67 

30 

0.05 

0.04 

A 

3.2 

45 

0.12 

0.06 

A 

3.5 

59 

0.79 

0.06 

A 

Romance 

2.81 

13 

0 

0.06 

R 

2.65 

20 

0 

0.05 

R 

2.98 

48 

0 

0.08 

R 

Thriller 

3.5 

30 

0.05 

0.08 

A 

3.5 

29 

0.1 

0.06 

A 

3.5 

40 

0.53 

0.06 

A 

Action 

3.5 

37 

0.3 

0.06 

A 

3.5 

50 

0.07 

0.08 

A 

3.23 

53 

0 

0.08 

R 

Crime 

2.93 

11 

0 

0.06 

R 

3.5 

49 

0.1 

0.13 

A 

3.5 

40 

0.03 

0.08 

close 

Others 

3.5 

52 

0.1 

0.08 

A 

3.5 

65 

0.02 

0.07 

close 

3.5 

100 

0.13 

0.07 

A 


We evaluate the significance of power law in the different Bollywood subnetworks of three time 
spans by using maximum-likelihood fitting methods with goodness-of-fit tests based on the Kolmogorov- 
Smirnov (KS) statistic and likelihood ratios. The power-law hypothesis is accepted if the p-value is 


> 0.05. 


SI Table 2: Average c 


ustering coefficient va 

ues for corresponding 

Layer 

(C)X -12 

(C) 03-07 

(C) 98-02 

Social 

0.02 

0.02 

0.3 

Drama 

0.01 

0.02 

0.04 

Comedy 

0.02 

0.03 

0.06 

Romance 

0.02 

0.02 

0.05 

Thriller 

0.02 

0.03 

0.06 

Action 

0.03 

0.04 

0.05 

Crime 

0.03 

0.05 

0.06 

Others 

0.01 

0.01 

0.03 


Threshold for enlisting lead actors: A very low value of threshold brings in lot of the sup¬ 
porting actors in the list of lead actors, whereas a very high value of threshold may leave behind 
many known lead actors. For 08-12 dataset, decreasing the threshold from 5 to 4 movies increases 
the count of lead actors from 44 to 62, while increasing the threshold to 6 movies decreases the count 
to 27. For 03-07 dataset, decreasing the threshold from 5 to 4 movies increases the count of lead 
actors from 33 to 45, while increasing the threshold to 6 movies decreases the count to 24. For 98-02 
dataset, decreasing the threshold from 5 to 4 movies increases the count of lead actors from 19 to 27, 
while increasing the threshold to 6 movies decreases the count to 15. 

Exception in LL dimers: Note that in the 03-07 dataset, there is only one LL actor pair 
Amitabh Bachchan - Abhishek Bachachan the father-son duo who have appeared in all the eight 
genres. It has already been discussed in our earlier work that Amitabh Bachchan has always had a 
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SI Table 3: Pearson (degree-degree) correlation coefficient values for corresponding random networks 
having the same degree sequence as of the Bollywood networks, known as configuration model. 


Layer 

conj 

r 08-12 

conj 

r 03-07 

conf 

r 98-02 

Social 

0.02 

-0.01 

-0.05 

Drama 

-0.03 

0.01 

0.02 

Comedy 

0.02 

-0.02 

0.02 

Romance 

-0.01 

-0.03 

0.01 

Thriller 

-0.03 

-0.03 

0.04 

Action 

-0.02 

0 

-0.02 

Crime 

0.02 

0.02 

-0.03 

Others 

-0.02 

0.01 

0 


SI Table 4: Standard deviation of award nominations for dimers and triangles in 08-12, 03-07 and 
98-02 datasets._ 


NLayers 

08-12 

03-07 

98-02 


LL 

LS 

LLL 

LLS 

LSS 

LL 

LS 

LLL 

LLS 

LSS 

LL 

LS 

LLL 

LLS 

LSS 


1 

1.2 

1.1 

2.9 

2.8 

2.8 

2.9 

2.8 

1.2 

1.1 

1.1 

2.6 

2.4 

2.5 

2.4 

2.4 

2 

1.1 

1.1 

2.9 

2.8 

2.8 

2.9 

2.8 

1.2 

1.1 

1.1 

2.5 

2.4 

2.6 

2.4 

2.4 

3 

1.2 

1.2 

3.1 

2.9 

2.8 

2.9 

2.8 

1.4 

1.2 

1.1 

2.2 

2.4 

1.5 

2.5 

2.4 

4 

1.3 

1.2 

3.3 

3.1 

3.1 

3 

2.9 

0.9 

0.9 

1.2 

2.5 

2.4 

3 

2.6 

2.4 

5 

1.6 

1.2 

0 

3.6 

3.5 

3.6 

3.1 

0.6 

0.8 

0.7 

2.6 

2.5 

- 

2.6 

2.4 

6 

0.8 

1.4 

- 

2.8 

3.9 

2.7 

3.4 

- 

- 

- 

- 

2.6 

- 

0.7 

2.6 

7 

- 

0.7 

- 

- 

0 

0 

3.7 

- 

- 

- 

0.7 

2.1 

- 

0.7 

2.6 

8 

- 

- 

- 

- 

- 

2.8 

- 

- 

- 

- 

- 

- 

- 

- 

- 


SI Table 5: High degree nodes in 08-12 dataset. The table shows the top two degree nodes naming 
hub actors in each layer followed by their ranking in multiplex network ( R ne w ) and the networks 


studied before 


Raid)- 


Layer 

Hub Actors 

F^new 

F^old 

Action 

Mushtaq Khan, Yashpal Sliarma 

1,2 

4,8 

Crime & horror 

Zakir Hussain, Murli Sharma 

1,2 

11, 10 

Comedy 

Rajpal Yadav, Manoj Joshi 

1,2 

6, 12 

Drama 

Anupam Kher, Boman Irani 

1,2 

1, 27 

Other 

Govind Namdev, Jackie Shroff 

1,2 

7, 13 

Romance 

Anupam Kher, Boman Irani 

1,2 

1, 27 

Social 

Rajpal Yadav, Anupam Kher 

1,2 

6, 1 

Thriller 

Zakir Hussain, Gulshan Grover 

1,2 

11, 20 


successful realm and visibly stands out of the domain of our analysis and defies the common trends. 
Taking this aspect into consideration, we do not consider this LL actor pair in our analysis. 
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SI Table 6: High degree nodes in 03-07 dataset. The table shows the top two degree nodes naming 
hub actors in each layer followed by their ranking in multiplex network (R new ) and the networks 
studied before ( R 0 id )• 


Layer 

Hub Actors 

R"new 

F^old 

Action 

Gulshan Grover, Sunicl Shetty, Rajpal Yadav 

1,2,2 

4, 5, 3 

Crime & horror 

Pratima Kazmi, Suresh Dubey 

1,2 

34, 199 

Comedy 

Rajpal Yadav, Asrani 

1,2 

3, 26 

Drama 

Amitabh Bachchan, Om Puri 

1,2 

1, 12 

Other 

Rani Mukherjee, Amitabh Bachchan 

1,2 

25, 1 

Romance 

Satisli Shah, Sliarat Saxena 

1,2 

21, 7 

Social 

Anupam Kher, Vivek Shauq 

1,2 

2, 19 

Thriller 

Gulshan Grover, Priyanka Chopra 

1,2 

4, 37 


SI Table 7: High degree nodes in 98-02 dataset. The table shows the top two degree nodes naming 
hub actors in each layer followed by their ranking in multiplex network (R new ) and the networks 
studied before (R 0 id)- 


Layer 

Hub Actors 

Rnew 

Rold 

Action 

Johny Lever, Mohan Joshi 

1,2 

1,8 

Crime & horror 

Johny Lever, Razzak Khan 

1,2 

1,3 

Comedy 

Johny Lever, Razzak Khan 

1,2 

1,3 

Drama 

Johny Lever, Kulbhushan Kharbanda 

1,2 

1, 16 

Other 

Shakti Kapoor, Anil Nagrath 

1,2 

2,5 

Romance 

Johny Lever, Alok Nath 

1,2 

1, 10 

Social 


1,2 


Thriller 

Johny Lever, Dinesh Hingoo 

1,2 

1,4 


SI Table 8: Pair-wise analysis of actors uniquely common to one, two or more layers in 08-12, 03-07 
and 98-02 datasets. N pa i r denotes the number of pairs that are uniquely common to one or more 
layers and r pair gives the average of the degree-degree correlation of these actor pairs. Pll, Pls 
and Pss represent the number of lead-lead, lead-supporting and supporting-supporting actor pairs 
uniquely common to one, two or more layers. 


Layers 

7W-U8-12 

v pair 

pUH-12 

r LL 

7YTU8-UV 

1 v pair 

p(J3—UY 
r LL 

v pair 

P 98-U2 

r LL 

1 

33696 

111 

25692 

101 

42616 

13 

2 

20542 

96 

16415 

54 

10680 

17 

3 

3718 

43 

8707 

36 

7056 

5 

4 

901 

19 

2514 

35 

2986 

9 

5 

158 

8 

1668 

13 

1322 

7 

6 

34 

4 

180 

7 

391 

- 

7 

7 

- 

69 

1 

119 

1 

8 

- 

- 

7 

1 

- 

- 


13 



























SI Table 9: Number of cliques (A) unique to one, two or more layers for 08-12 dataset and A j denotes 
participating nodes. A LLL , A LL $, A L g$ and A S ss denote number of cliques unique to LLL, LLS, 
LSS and SSS cliques, respectively. 


Layers 

n a 

A/ 

A LLL 

A lls 

A LSS 

A sss 

1 

2,03,253 

2983 

206 

5187 

43412 

154448 

2 

1,23,177 

1719 

86 

2438 

25851 

94802 

3 

13,525 

566 

4 

200 

2545 

10776 

4 

2,118 

171 

2 

52 

401 

1663 

5 

67 

276 

1 

15 

23 

28 

6 

- 

- 

- 

- 

- 

- 

7 

1 

3 

- 

- 

- 

1 

8 

- 

- 

- 

- 

- 

- 


SI Table 10: Number of cliques unique in one, two or more layers for 03-07 dataset. The terminologies 


are the same as usee 


in Tabic l9l 


Layers 

n a 

A/ 

A LLL 

A lls 

A LSS 

A sss 

1 

2,10,962 

2216 

786 

9497 

48548 

152131 

2 

1,24,310 

1338 

111 

2219 

19454 

102526 

3 

57,531 

818 

34 

742 

7690 

49065 

4 

12,710 

361 

19 

412 

2973 

9306 

5 

12,035 

202 

1 

63 

1841 

10130 

6 

136 

50 

- 

4 

57 

75 

7 

42 

17 

- 

- 

3 

39 

8 

3 

1 

- 

- 

- 

1 


SI Table 11: Number of cliques unique in one, two or more layers for 98-02 dataset. The terminologies 


are the same as usee 


in Tabic l9l 


Layers 

n a 

A/ 

A LLL 

A lls 

A LSS 

A sss 

1 

2,27,222 

1470 

22 

1579 

33159 

192462 

2 

1,07,170 

933 

6 

714 

16184 

90266 

3 

70,134 

627 

1 

281 

9740 

60112 

4 

17,659 

344 

2 

130 

3341 

14186 

5 

4,836 

223 

- 

48 

1008 

3780 

6 

358 

109 

- 

1 

66 

291 

7 

83 

35 

- 

1 

10 

72 

8 

- 

- 

- 

- 

- 

- 
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03-07 


98-02 



SI Figure 1: Overlap versus link betweenness centrality on a logarithmic scale for 03-07 and 98-02 
datasets. 


SI Table 12: List of actor-pairs having high link betweenness centrality but low overlap for 08-12 
dataset. _ 


Layer 

ft L~0 cor r 

Links (High low O ) 

Action 

-0.48 

Arnit Belli (107) - Dalip Tahil (246) 

Comedy 

-0.43 

Aslam Khan (55) - Ankita Shrivastava (10) 

Drama 

-0.58 

Om Puri (418) - Emil Marwa (16) 

Thriller 

-0.46 

Rahul Bose (111) - Nandita Das (43) 

Social 

-0.50 

Naseeruddin Shah (254) - Iman Ali (15) 

Romance 

-0.49 

Anushka Sharma (46) - Parineeta Chopra (24) 

Crime and Horror 

-0.44 

Hemant Pandey (195) - Paintal (72) 

Others 

-0.57 

Anil Nagrath (134) - Vinod Tripathi (23) 


SI Table 13: List of actor-pairs having high link betweenness centrality but low overlap for 03-07 
dataset. _ 


Layer 

ftL~0 C or r 

Links (High /3l low O ) 

Action 

-0.52 

Gulshan Grover (389) - Govinda (158) 

Comedy 

-0.009 

Dclnaz Paul (74) - Rajpal Yadav (434) 

Drama 

-0.41 

Esha Deol (196) - Meera Jasmine (22) 

Thriller 

-0.55 

Gulshan Grover (389) - Manisha Koirala (116) 

Social 

-0.59 

Yashpal Sharma (256) - Kashish Duggal (22) 

Romance 

-0.36 

Dinesli Hingoo (264) - Shweta Menon (158) 

Crime and Horror 

-0.64 

Zakir Hussain (186) - Dilip Prabhawalkar (119) 

Others 

-0.41 

Cheran (7) - Prakash Raj (57) 
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SI Table 14: List of actor-pairs having high link betweenness centrality but low overlap for 98-02 
dat aset. _ 


Layer 

ft L~0 cor r 

Links (High f} L low O ) 

Action 

-0.44 

Mukesli Rislii (254) - Soundarya (18) 

Comedy 

-0.25 

Simran (97) - Nagesh (23) 

Drama 

-0.34 

Tabu (229) - Sukumari (43) 

Thriller 

-0.60 

Johny Lever (571) - Sanjay Mishra (63) 

Social 

- 

- 

Romance 

-0.37 

Ameesha Patel (114) - Pawan Kalyan (13) 

Crime and Horror 

-0.60 

Anil Nagrath (405) - Paresh Rawal (331) 

Others 

-0.39 

Tabu (229) - Gajraj Rao (64) 
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